RESEARCH 


® 


MICROBIOLOGY B. bigelowii cell architecture and Check for 
organelle division | Updates | 


Nitrogen-fixing organelle in a marine alga 
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Wing Kwan Esther Makt, Shunyan Cheung’, Axel Ekman?, Jian-Hua Chen??, Kyoko Hagino”, 
Yoshihito Takano”, Tomohiro Nishimura”, Masao Adachi’, Mark Le Gros”*, 


Carolyn Larabell”?, Jonathan P. Zehr'* 


Symbiotic interactions were key to the evolution of chloroplast and mitochondria organelles, which 
mediate carbon and energy metabolism in eukaryotes. Biological nitrogen fixation, the reduction of 
abundant atmospheric nitrogen gas (N2) to biologically available ammonia, is a key metabolic process 
performed exclusively by prokaryotes. Candidatus Atelocyanobacterium thalassa, or UCYN-A, is a 
metabolically streamlined N2-fixing cyanobacterium previously reported to be an endosymbiont of a 
marine unicellular alga. Here we show that UCYN-A has been tightly integrated into algal cell architecture 
and organellar division and that it imports proteins encoded by the algal genome. These are 
characteristics of organelles and show that UCYN-A has evolved beyond endosymbiosis and functions 
as an early evolutionary stage No-fixing organelle, or “nitroplast.” 


ymbioses, the living together of two or 

more species, involve varied metabolic 

interactions and diverse eukaryotic and 

prokaryotic organisms (1). Such inter- 

actions between unicellular organisms 
were key for evolution of the organelles that 
are the basis of eukaryotic life on Earth, in- 
cluding the metabolic innovations of aerobic 
respiration and photosynthesis of mitochondria 
and chloroplasts, respectively (2, 3). Despite 
the success of such organelles, primary endo- 
symbiosis remains exceptionally rare, with 
only three occurrences ever described (4). 
Biological nitrogen (N3) fixation, the reduction 
of abundant inert atmospheric N, gas to am- 
monia, is a key metabolic innovation that main- 
tains the fertility of terrestrial and aquatic 
environments. N, fixation in eukaryotes is 
only known to exist through diverse sym- 
bioses with Bacteria or Archaea, the microbes 
capable of N, fixation (5). Several presumed 
endosymbiotic interactions have been described 
that involve unicellular eukaryotic algae and 
N,-fixing intracellular spheroid bodies that are 
evolutionarily related to cyanobacteria (6), in- 
cluding diatom symbionts (7) and the Candidatus 
Atelocyanobacterium thalassa (UCYN-A) sym- 
biosis with a haptophyte alga (8). However, it 
has remained unclear where these symbiotic 
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relationships fall along the spectrum of endo- 
symbiosis to organelle, and no N,-fixing organ- 
elle has yet been discovered (9, 10). 

The UCYN-A symbiosis is between closely 
related strains of a metabolically streamlined 
N.-fixing cyanobacterium (UCYN-A) and the 
marine haptophyte algae Braarudosphaera 
bigelowti and relatives (8, 11, 12). Different 
strains include coastal and open-ocean eco- 
types (13) that are widely distributed through- 
out the global ocean, including polar waters 
(14), and are important in global marine N3 
fixation (15). Both ecotypes have been shown 
to have some characteristics of an organelle, 
including metabolic streamlining as a result 
of genome reduction, evidenced by the loss of 
carbon fixation and oxygenic photosynthesis 
genes, and the entire tricarboxylic acid cycle 
(16). It has also recently been shown that the 
sizes of the partners in the different strains 
have an organelle-like size relationship (17). 
However, key biological features of organelles, 
such as coordinated cell division, protein im- 
port by the endosymbiont, or gene exchange 
between partner cells, as have been shown in 
the plastid-like chromatophore of the protist 
Paulinella (4), have never been demonstrated 
in UCYN-A or any of the N.-fixing spheroid 
bodies in diatoms (9). A recently obtained sta- 
ble culture of B. bigelowti/UCYN-A (the coastal 
strain, UCYN-A2) was used to interrogate UCYN- 
A for organelle-like characteristics. Here we 
present results of soft x-ray tomography (SXT) 
(18) and proteomic analyses showing that (i) 
UCYN-A is integrated into the B. bigelowii cell 
architecture and divides in a defined synchronous 
sequence with the organelles and (ii) it receives 
proteins that are encoded by the B. bigelowii 
genome and targeted to UCYN-A, both of which 
are criteria commonly used to distinguish or- 
ganelles from endosymbionts (19). Thus, UCYN-A 
appears to have molecular and cellular char- 
acteristics of an early stage N.-fixing organelle. 


UCYN-A is amembrane-enclosed spherical body 
[which is enclosed in an internal B. bigelowii 
membrane (72)] that lies at the posterior end 
of the B. bigelowii cell, opposite the end with 
the haptonema and between the two chloro- 
plasts (Fig. 1, A and B, and fig. S3A). UCYN-A 
cannot fix CO,, as it has lost the genes for 
carbon fixation and oxygenic photosynthesis, 
but has a complete set of nif genes and fixes 
Ns (16, 20). Isotopic tracer studies showed that 
fixed N is rapidly transferred to the B. bigelowii 
cell, and B. bigelowii fixes CO, into metabolites 
that are taken up by UCYN-A (8). The intra- 
cellular location of UCYN-A with respect to 
the haptophyte organelles is likely to be crit- 
ical for the exchange of numerous metabolites 
(21) and regulation of oxygen flux given that 
UCYN-A is known to only fix N, during the day 
when the haptophyte is evolving oxygen (22) 
and the N.-fixing enzyme, nitrogenase, is sen- 
sitive to inactivation by oxygen (23). There is 
only a single UCYN-A per B. bigelowii cell, so 
division of UCYN-A must be coordinated with 
the B. bigelowii eukaryotic cell to maintain the 
unicellular symbiosis (24). 

We determined the three-dimensional (3D) 
structure of B. bigelowii and dynamics of in- 
tracellular body division with SXT (Fig. 1 and 
fig. S1). Prior imaging studies used 2D tech- 
niques, such as transmission electron micro- 
scopy (11, 12), that did not resolve the precise 
location of UCYN-A relative to the organelles 
or determine how UCYN-A divides relative 
to other B. bigelowii cell constituents. SXT 
not only resolves the 3D structure of cel- 
lular components but also measures changes 
in their density and composition using the 
linear absorption coefficient (LAC) (25). Using 
SXT, hundreds of intact individual specimens 
were imaged, and 49 were segmented for vol- 
ume and LAC measurements to quantify cell 
cycle changes in the structural organization 
and molecular density of cellular components 
throughout the daily light-dark cycle (see sup- 
plementary methods). 

UCYN-A is a prominent intracellular body 
(15.2 + 3.3 um?) which occupies 8.6% of the 
intracellular space (Fig. 1A and tables S1 and S3). 
The mitochondrial network wraps almost en- 
tirely around UCYN-A and separates it from 
the nucleus (fig. S2A and movie S1). Quantita- 
tive analysis of the 3D SXT data showed that 
the basic structural organization remains rel- 
atively constant throughout the light cycle, when 
UCYN-A is fixing N, (Fig. 1A and fig. S3C) (22). The 
positioning of mitochondria around UCYN-A is 
similar to mitochondrial-organelle interactions 
seen in other cells at times of increased energy 
demand, such as in mammalian neurons and 
pancreatic cells (26, 27). The proximity and ar- 
rangement of these structures suggests that the 
large adenosine triphosphate demand of Nə 


1 of 6 


RESEARCH | RESEARCH ARTICLE 


Fig. 1. B. bigelowii/UCYN-A light- 
dark cycle is highly coordinated. 
(A) Structural characterization 
during the light cycle: (Left) Differ- 
ential interference contrast (DIC) 
micrograph displays the whole alga 
and showcases UCYN-A spheroid 
body (SB) and plastids (Pls) in the 
FR21 culture; the correspondent 
4'.6-diamidino-2-phenylindole 
(DAPI) staining reveals the cell 
nucleus (Nu). In both images, the 
UCYN-A spheroid body (SB) is 
identified above the nucleus. Scale 
bars, 5 um. (Middle) Orthoslice from 
SXT data; color-coded 3D render- 
ings of subcellular components. 
(Right) LAC (molecular density) and 
volume measurements of UCYN-A 
and organelles. (B) 3D SXT render- 
ings at intervals during the dark 
cycle (left to right) depict the 
replication and fission of UCYN-A 
and B. bigelowii organelles; quantifi- 
cation of changes in volumes and 
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organelle volume and LAC are 


reported as follows: mitochondrial volume, ****P < 0.0001, ***P = 0.005, **P = 0.0035, and *P = 0.0188 (Tukey's multiple comparison test); mitochondria LAC, 
****P < 0.0001, ***P = 0.0004 (Tukey's multiple comparison test); UCYN-A volume, ***P = 0.0009 and *P = 0.0133 (Dunn's multiple comparison test); UCYN-A 
LAC, ****P < 0.0001 and ***P = 0.0005 (Tukey's multiple comparison test); nuclear volume, ****P < 0.0001, **P = 0.0023 (Tukey's multiple comparison test); 


nuclear LAC, *P = 0.0105 (Dunn's multiple compariso 
0.0396 (Dunn's multiple comparison test). 


fixation in UCYN-A is supplied by the closely 
associated mitochondria. 

The process of mitotic cell replication (Fig. 1B), 
which occurs during the dark cycle, was initiated 
with replication of the mitochondria. The mito- 
chondrial volume of cells examined midway 
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through the dark cycle was twice that of cells 
examined at the beginning of the dark cycle 
(Fig. 1C). The mitochondrial network divides 
into two segments (fig. S2B and movie S2), 
each with a decreased LAC value suggesting 


the presence of fewer cristae and, therefore, 
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n test); plastid volume, ***P = 0.0008 and **P = 0.0011 (Dunn's multiple comparison test); plastid LAC, *P = 


reduced cellular respiration. Replication and 
fission of UCYN-A proceeded in the absence of 
surrounding mitochondria, which were at this 
point concentrated between the B. bigelowii 
nucleus and plastids. There was minimal volu- 
metric change of UCYN-A during replication, 
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Fig. 2. B. bigelowii contribution to the UCYN-A proteome. (A) Uniform 
manifold approximation and projection (UMAP) clustering analysis reconstructed 
from all detected protein relative abundances from all samples. B. bigelowii- 
encoded proteins shown as circles, UCYN-A encoded as triangles. Icon size 
indicates average relative abundance in all samples. The left plot shows individual 
proteins colored by their average logs fold change (FC) between UCYN-A and 
B. bigelowii samples. Right plot shows proteins colored by their genome of origin. 
(B) Selected B. bigelowii proteins detected in UCYN-A indicated with white 
bars, and black segments correspond to mass spectrometry-identified peptides. 


Top five BlastP hits are shown as bars colored by taxonomic origin. Motifs identified 
by MEME block diagram shown as colored blocks (see supplementary methods). 
ThrC, threonine synthase. (C) Average contribution to UCYN-A proteome by UCYN-A 
and B. bigelowii-encoded proteins in day and night samples. Only proteins significantly 
more abundant in UCYN-A samples (DESeq2 Wald test P < 0.05) are included. 

(D) The comprehensive UCYN-A proteome. Numbers indicate count of unique proteins 
in each category. “Detected” refers to identification in the proteomic dataset. 
“Enriched” indicates statistically more abundant in UCYN-A samples (DESeq2 Wald 
test P < 0.05). uTP indicates UCYN-A transit peptide detected by HMM search. 


but fission was made obvious by the presence of 
numerous cells with two distinct but smaller 
UCYN-A bodies (9.9 + 1.8 um? and 5.0 + 0.7%; 
table S2). There was also an ~18% decrease in 
LAC value for each UCYN-A (Fig. 1C and table 
S2), suggesting that the newly formed UCYN-A 
are less densely packed immediately after fis- 
sion. As UCYN-A completed division, the 
B. bigelowii nuclei replicated, which is shown 
by a 37.5% increase in the percent volume (Fig. 
1C and fig. S2C) and decreased LAC values, 
indicating that less volume is required for the 
replicated DNA than for the new nuclear en- 
velope and nucleoplasm. As duplication of the 
nucleus continued, distinct invaginations in both 
B. bigelowii plastids were detected, indicat- 
ing the onset of plastid replication and fission 
(Fig. 1B and fig. S2D). Fission of the plastids 
and B. bigelowii nucleus occurred simultane- 
ously, followed by cell division (movie S3). Cell 
division appears to be first led by migration of 
the organelles and then completed with the 
migration of UCYN-A. The precise sequence of 
division of UCYN-A along with B. bigelowii 
organelles shows that UCYN-A is integrated 
into cell growth and division, and the coor- 
dination implies control by B. bigelowii, which 
was previously suggested by whole-genome ex- 
pression analysis (24). Such integration is a 
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required evolutionary step along the trajectory 
of evolution from an endosymbiont to an or- 
ganelle (28, 29). The SXT results show not only 
that UCYN-A is distinctly positioned with re- 
spect to organelles in the B. bigelowii cell but 
also that it participates in synchrony with 
B. bigelowii organelle and cell division, which 
ensures that UCYN-A is vertically transmitted 
to daughter cells along with the organelles. 


UCYN-A imports B. bigelowii 
nuclear-encoded proteins 


To investigate the contribution of B. bigelowii 
genome-encoded proteins to the UCYN-A pro- 
teome, the protein composition of UCYN-A was 
determined by comparing protein abundances 
in the whole symbiosis and isolated UCYN-A 
bodies. B. bigelowii cells and isolated UCYN-A 
were obtained by filtration, cell disruption, and 
density gradient centrifugation (fig. S3B). Puri- 
ty of isolated UCYN-A was determined by light 
microscopy, flow cytometry, and digital droplet 
polymerase chain reaction targeting specific 
nuclear and UCYN-A markers (figs. S3B and 
S4, A to D), which was then corroborated by 
the proteomic profiling. For example, during the 
day, the ribulose-1,5-bisphosphate carboxylase/ 
oxygenase (RuBisCO) protein RbcL was >2200 
times more abundant in whole B. bigelowii 


samples, whereas the nitrogenase iron protein 
(NifH) was 37 times more abundant in UCYN- 
A samples (table S4). Using predicted proteins 
from the UCYN-A genome and B. bigelowii geno- 
mic and transcriptomic datasets (12), 1804: total 
proteins were detected, including 609 encoded in 
the UCYN-A genome and 1156 from B. bigelowii. 
Proteins that were more abundant in isolated 
UCYN-A relative to B. bigelowii (DESeq2 Wald 
test, P < 0.05) were considered enriched and 
assigned to the comprehensive UCYN-A pro- 
teome used in further analyses. 

Most proteins encoded by either genome 
were more abundant in their respective sample 
type (data S1 and S2 and fig. S3E), but many 
B. bigelowti-encoded proteins were found in 
UCYN-A (Fig. 2A and fig. S4E). When all pro- 
teins significantly enriched in UCYN-A samples 
were weighted by their abundance, B. bigelowii- 
encoded proteins were found to make up about 
28% of the UCYN-A proteome during the 
day and 11% at night (Fig. 2C). In total, 368 
B. bigelowii-encoded proteins were determined 
to be more abundant in UCYN-A samples. The 
many nuclear-encoded proteins contained in 
UCYN-A indicate import from the surround- 
ing B. bigelowii cell. 

Putatively imported protein sequences had 
characteristics of import machinery associated 
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Fig. 3. B. bigelowii participation in UCYN-A metabolic pathways and daytime 
No fixation. (A) UCYN-A and B. bigelowii genes in UCYN-A metabolic pathways. 
Green outline indicates a UCYN-A-encoded gene. Purple outline indicates a uTP 
sequence but no protein detected. Purple fill indicates that a B. bigelowii protein 
was identified by a significant increase in protein abundance (DESeq2 Wald test, 

P < 0.05) in UCYN-A samples and the presence of a uTP sequence. Black outline 
shows genes missing from the UCYN-A genome and lacking a uTP-containing 
B. bigelowii homolog. See data S7 for abbreviations. (B) Relative abundance of 


with translocation to organelles. Many of the 
B. bigelowii-encoded proteins present in UCYN-A 
were considerably longer than orthologous pro- 
teins owing to a C-terminal extension (Fig. 2B). 
Alignment of these proteins revealed a series of 
conserved motifs within a region lacking any 
known protein domains (fig. S5). Analysis of 
proteomic mass spectrometry data failed to in- 
dicate peptides matching this C-terminal region, 
potentially indicating that it is cleaved or un- 
translated (Fig. 2B). The presence of this 
sequence within all predicted B. bigelowii 
proteins was determined using a custom hidden 
Markov model (HMM) constructed from 233 
B. bigelowii protein sequences enriched in 
UCYN-A samples (fig. S5B). We detected the 
sequence in 258 of the 368 B. bigelowii- 
encoded proteins enriched in UCYN-A, which 
accounted for 62.6% of all B. bigelowii protein 
quantified in UCYN-A (fig. S6C). We also re- 
covered 476 additional B. bigelowii proteins 
that contained this sequence but were not de- 
tected in any proteomes. It appears that this se- 
quence is a distinctive transit peptide, analogous 
to those used in protein import in chloroplasts 
(30), mitochondria (37), and the chromatophore 
(32). This UCYN-A transit peptide (uTP) presum- 
ably directs proteins to UCYN-A and therefore 
may constitute a marker that can identify UCYN- 
A-localized proteins (table S5). Combining the 
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detected B. bigelowii-encoded UCYN-A proteins 
with those predicted on the basis of the HMM 
search yielded 900 proteins potentially comple- 
menting the 1186 encoded by the UCYN-A 
genome (Fig. 2D). As previously reported, 
there is no evidence of gene transfer from 
UCYN-A to the B. bigelowii nuclear genome 
(12), and these 900 proteins are primarily of 
eukaryotic origin (table S6). 

Imported B. bigelowii-encoded proteins com- 
pleted some missing steps in biosynthetic path- 
ways that are incomplete and only partially 
encoded in the UCYN-A genome. For example, 
the UCYN-A genome lacks genes required for 
biosynthesis of certain amino acids, nucleotides, 
vitamins, and cofactors (16, 24). B. bigelowii 
proteins predicted to localize to UCYN-A (by 
differential protein abundance or the presence 
of a uTP sequence) were able to complete the 
pathway for biosynthesis of threonine and 
contribute to the incomplete proline and 
serine biosynthetic pathways (Fig. 3A and data 
S3 and S7). The UCYN-A genome is also miss- 
ing orotate phosphoribosyltransferase (PyrE), 
which is required for de novo biosynthesis 
of pyrimidine nucleotides (33). B. bigelowii- 
encoded PyrE was found to be enriched in 
UCYN-A samples and contained a uTP (Fig. 3A). 
Both UCYN-A and B. bigelowii encode dihydro- 
orotase (PyrC) required for pyrimidine biosyn- 


UCYN-A and uTP-containing B. bigelowii proteins in day and night UCYN-A 
samples. Error bars show +1 SD. Asterisk indicates significant increase in 
daytime abundance versus night (DESeq2 Wald test, P < 0.05). NifH, 
dinitrogenase reductase; PetH, ferredoxin NADP reductase; Fd, ferredoxin; 

PC, plastocyanin; Prxs, peroxiredoxin; FIdA, flavodoxin; CYP, cytochrome P450; 
Cry, cryptochrome; HemeO, heme oxygenase; EF-Tu, elongation factor thermal 
unstable; Fmt, methionyl-tRNA formyltransferase. Fd, FldA, CYP, and Cry 
show the sum of all detected proteins sharing these annotations. 
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thesis, and both were enriched in UCYN-A and 
present in similar quantities. The bifunctional 
enzyme dihydrofolate reductase-thymidylate syn- 
thase (DHFR-TS) is targeted to UCYN-A with a 
uTP, and it is the only enzyme missing from the 
tetrahydrofolate biosynthetic pathway (Fig. 3A). 
These findings suggest that the evolutionary 
reduction of the UCYN-A genome was likely 
influenced by redundancies in the B. bigelowii 
genome. 

Many of the most abundant B. bigelowii pro- 
teins found in UCYN-A are known to support 
N; fixation in other diazotrophs. For instance, 
flavodoxin is the preferred electron donor to 
nitrogenase in a free-living relative of UCYN-A 
(34), and both ferredoxin and flavodoxin are 
known electron donors to nitrogenase (35, 36). 
However, the UCYN-A genome lacks flavodoxin 
altogether, which has led to the assumption 
that ferredoxin is responsible for nitrogenase 
reduction (27, 24). A B. bigelowii—encoded flavo- 
doxin (FIdA) containing a uTP was among the 
most common B. bigelowii proteins in UCYN-A 
during the day (Fig. 3B and data S4 and S5). 
UCYN-A contains five ferredoxins, which were 
detected in our proteomes (data S5). Two of 
these were more abundant in UCYN-A during 
the day, including a 2Fe-2S plant type and a 
4¥Fe-4S Nif-specific type. The remaining three 
UCYN-A-encoded ferredoxins were among the 
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few UCYN-A proteins with higher abundances 
in the whole-cell samples, indicating export 
from UCYN-A to B. bigelowii. Ferredoxin is 
known to cross membranes by means of spe- 
cific transporters (37), but we detected no homo- 
logs of these proteins in either B. bigelowii or 
UCYN-A. The dynamics of these electron donors 
and their redox partners are yet unclear, but 
it appears that redox activity is coordinated 
between UCYN-A and B. bigelowii, and it is 
possible that B. bigelowii-encoded flavodoxin 
is an electron donor to UCYN-A nitrogenase. 

Diazotrophs are particularly susceptible to 
damage by reactive oxygen and N species pro- 
duced by high respiratory rates in the vicinity 
of nitrogenase and consequently use antioxidant 
defenses (38). Several B. bigelowii proteins in- 
volved in mediating oxidative damage were 
found in UCYN-A. Cytochrome P450s (CYPs) 
are heme-thiolate-containing oxygenases that 
serve diverse functional roles (39), and certain 
cytochrome P450s are involved in the biosyn- 
thesis of antioxidant flavonoids (40). Out of 
24 B. bigelowii CYP protein sequences, 10 con- 
tained a uTP sequence including all five CYP 
proteins detected by mass spectrometry. All of 
these were more abundant in UCYN-A sam- 
ples, and four of the five detected CYPs were 
more abundant during the day, when N, fixa- 
tion is occurring (Fig. 3B and data S4). All uTP- 
containing CYPs are class I cytochrome P450s 
predicted to use a ferredoxin redox partner 
(4D. B. bigelowii-encoded glutathione dehydro- 
genase, which is involved in the glutathione 
antioxidant response (42), was detected to be 
more abundant in UCYN-A samples and con- 
tains a uTP (data S4). Also abundant in UCYN-A 
was a B. bigelowii-encoded heme oxygenase 
(Heme0O), which could be participating in heme 
catabolism for Fe acquisition or in the antioxi- 
dant defense of nitrogenase (34, 43). Diazotrophs 
that are actively fixing N, also have elevated 
levels of biochemical reactions fueled by respi- 
ration and that involve strong reducing po- 
tentials (38). The data show that in UCYN-A, 
this activity appears to also be accompanied by 
import of protective B. bigelowii proteins (Fig. 3B). 

Other imported proteins are potentially 
involved in regulation of UCYN-A by the 
B. bigelowii cell. Methionine-tRNA formyl- 
transferase (Fmt) is critical for initiating trans- 
lation in prokaryotes, mitochondria, and plastids 
of eukaryotes through import of a nuclear- 
encoded protein (44). The fmt gene is absent 
from the UCYN-A genome, but a B. bigelowii- 
encoded Fmt with a uTP is present in the 
UCYN-A proteome, particularly during the day 
(Fig. 3B and data S6). 

The elongation factor Tu (EF-Tu) is a highly 
abundant bacterial protein essential for protein 
synthesis in prokaryotes but is also known to 
localize to the cell surface, where its functions 
are related to pathogenesis (45). B. bigelowii 
encodes an EF-Tu protein likely horizontally 
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acquired from Alphaproteobacteria, which was 
very highly abundant in UCYN-A. This protein 
does not contain a uTP sequence, which may be 
an indication that it adheres to the outer mem- 
brane rather than being imported. UCYN-A also 
encodes its own EF-Tu, which was also highly 
abundant in the UCYN-A proteome. Bacterial 
EF-Tu proteins are known to be shared be- 
tween different species (46) and to bind pro- 
teins of their symbiotic partners (47). This 
protein in UCYN-A likely mediates interac- 
tions with the surrounding B. bigelowii cell 
(Fig. 3B and data S6). 

As previously discussed (Fig. 1), the UCYN-A 
light-dark and division cycles are important 
for maintaining symbiosis. Circadian rhythms 
are important in many cyanobacteria for daily 
cycles and cell division. UCYN-A lacks two of the 
three kai genes (16, 48), which are responsible 
for generation of circadian rhythms in other 
cyanobacteria (49). In algae and higher plants, 
nuclear-encoded cryptochrome (Cry) proteins 
are known to regulate chloroplast gene ex- 
pression and coordinate circadian cycles across 
organelles (50). Two B. bigelowii-encoded Cry 
proteins, both containing uTP sequences, were 
highly abundant in UCYN-A (Fig. 2B and data 
S6). Their presence in UCYN-A is a further 
indication of B. bigelowii control over funda- 
mental UCYN-A diel processes such as cell 
cycle progression. 

UCYN-A thus has a mosaic proteome made 
up of proteins encoded by its own genome and 
those from the B. bigelowii genome that con- 
tain likely signal sequences for targeting and 
transport into UCYN-A. Many of these proteins 
are isofunctional with proteins lost during 
reduction of the UCYN-A genome, and some 
are redundant with existing UCYN-A proteins. 
The essential nature of some of these proteins 
suggests that control over translation and repli- 
cation has been transferred to B. bigelowii, 
which also appears to perform a substantial 
portion of UCYN-A protein synthesis. Most of 
these proteins are retargeted eukaryotic proteins, 
as is the case in the Paulinella chromatophore 
(32, 51), which is consistent with the interpre- 
tation that both are in an early stage of organ- 
ellogenesis arising from a similar duration of 
endosymbiosis (table S7) (52, 53). 


UCYN-A as a nitrogen-fixing organelle 


The relationship between UCYN-A and 
B. bigelowii has been assumed to be symbiotic 
since its initial discovery (8, 11, 54). This study 
shows that UCYN-A has evolved beyond endo- 
symbiosis and has characteristics of a Nə-fixing 
organelle, or “nitroplast,” an organelle that has 
previously been hypothesized (57). Such an 
organelle would be an N>-fixing analog of pri- 
mary carbon-fixing plastids, which appear to 
have only evolved twice, the first being the 
cyanobacterial endosymbiont ancestor of the 
Archaeplastida (4, 29), and the second being 


the photosynthetic organelle of the protist 
Paulinella resulting from the endosymbiosis 
between Paulinella and a cyanobacterium re- 
lated to Synechococcus (55). Genome reduc- 
tion is a characteristic of both endosymbiosis 
and organellogenesis (56). There are now clear 
parallels among evolution of mitochondria, 
chloroplasts, the chromatophore, and UCYN-A 
with respect to genome reduction, protein 
import, and size relationship to the cell. The 
Paulinella chromatophore and UCYN-A appear 
to both be in an intermediate stage of organ- 
ellogenesis and have undergone gene loss re- 
sulting in genomes of ~1 to 1.5 Mb. However, 
UCYN-A has undergone more-extensive gene 
loss, as No-fixing unicellular cyanobacteria typi- 
cally have genomes approaching 5 Mb, compared 
with 3 Mb for the non-N,-fixing cyanobacteria 
(such as Synechococcus), which are the proposed 
ancestors of the Paulinella chromatophore, 
and ~3 Mb for the diatom spheroid bodies 
(51). UCYN-A also appears to be more efficient, 
with doubling times of ~1.4 days (fig. S3C), com- 
pared with 5 to 7 days for the chromatophore 
(4). Thus, UCYN-A might be further along the 
spectrum of organellogenesis than the Paulinella 
chromatophore (4) or the diatom spheroid 
bodies (57). 

Model systems that provide a more complete 
picture of the events that occur along the 
path of organellogenesis are exceedingly rare. 
Although it has been suggested that there is 
no reason why an N,-fixing organelle could not 
be maintained in the cell, such organelles have 
yet to be described (57). The findings of this 
study, along with the regulation of spheroid 
body size (17) and coevolution of B. bigelowii 
and UCYN-A (53), suggest that the evolution 
of the nitroplast is underway and establish 
B. bigelowii as an important model system of 
great interest for understanding organello- 
genesis, with implications for contemporary 
efforts to engineer N>-fixing capabilities into 
agricultural plants (10, 57). 
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